A Pole–Zero Filter Cascade Provides Good Fits to Human Masking Data and to Basilar Membrane and Neural Data
نویسنده
چکیده
A cascade of two-pole–two-zero filters with level-dependent pole and zero dampings, with few parameters, can provide a good match to human psychophysical and physiological data. The model has been fitted to data on detection threshold for tones in notched-noise masking, including bandwidth and filter shape changes over a wide range of levels, and has been shown to provide better fits with fewer parameters compared to other auditory filter models such as gammachirps. Originally motivated as an efficient machine implementation of auditory filtering related to the WKB analysis method of cochlear wave propagation, such filter cascades also provide good fits to mechanical basilar membrane data, and to auditory nerve data, including linear low-frequency tail response, level-dependent peak gain, sharp tuning curves, nonlinear compression curves, levelindependent zero-crossing times in the impulse response, realistic instantaneous frequency glides, and appropriate level-dependent group delay even with minimum-phase response. As part of exploring different level-dependent parameterizations of such filter cascades, we have identified a simple sufficient condition for stable zero-crossing times, based on the shifting property of the Laplace transform: simply move all the s-domain poles and zeros by equal amounts in the real-s direction. Such pole-zero filter cascades are efficient front ends for machine hearing applications, such as music information retrieval, content identification, speech recognition, and sound indexing.
منابع مشابه
Cascades of two-pole-two-zero asymmetric resonators are good models of peripheral auditory function.
A cascade of two-pole-two-zero filter stages is a good model of the auditory periphery in two distinct ways. First, in the form of the pole-zero filter cascade, it acts as an auditory filter model that provides an excellent fit to data on human detection of tones in masking noise, with fewer fitting parameters than previously reported filter models such as the roex and gammachirp models. Second...
متن کاملFunctionality of Cochlear Micromechanics – as Elucidated by Upward Spread of Masking and Two Tone Suppression
It is generally accepted that the sharp tuning observed at the characteristic frequency (CF) auditory nerve fibre can be attributed to the sharp mechanical response at the corresponding position of the basilar membrane (at the Best Place BP). This observation has resulted in attention being focused on the basilar membrane and away from other micromechanical structures in the cochlea. However, a...
متن کاملThe All-Pole Gammatone Filter and Auditory Models
The All-Pole Gammatone Filter (APGF) is defined by discarding the zeros from a pole-zero decomposition of the Gamma-Tone Filter (GTF) that was popularized in auditory modeling by Johannesma, de Boer, Patterson, and others. Equivalently, the orderN APGF is the Nth power of a filter with a complex-conjugate pair of poles; the GTF has this same set of poles, but in addition has “spurious” zeros on...
متن کاملUsing a Cascade of Asymmetric Resonators with Fast-Acting Compression as a Cochlear Model for Machine-Hearing Applications
Every day, machines process many thousands of hours of audio signals through a realistic cochlear model. They extract features, inform classifiers and recommenders, and identify copyrighted material. The machine-hearing approach to such tasks has taken root in recent years, because hearingbased approaches perform better than we can do with more conventional sound-analysis approaches. We use a b...
متن کاملNonlinear Co 3 . Nonlinear Cochlear Signal Processingand Masking in Speech PerceptionJ
There are many classes of masking, but two major classes are easily defined: neural masking and dynamic masking. Neural masking characterizes the internal noise associated with the neural representation of the auditory signal, a form of loudness noise. Dynamic masking is strictly cochlear, and is associated with cochlear outerhair-cell processing. This form is responsible for dynamic nonlinear ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011